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Abstract 

This paper discusses the gambhng contest introduced in Seel & Strack [6] 
and considers the impact of adding a penahy associated with failure to 
follow a winning strategy. 

The Seel & Strack model consists of n-agents each of whom privately 
observes a transient diffusion process and chooses when to stop it. The 
player with the highest stopped value wins the contest, and each player's 
objective is to maximise their probability of winning the contest. We give 
a new derivation of the results of Seel & Strack fS] based on a Lagrangian 
approach. Moreover, we consider an extension of the problem in which in 
the case when an agent is penalised when their strategy is suboptimal, in 
the sense that they do not win the contest, but there existed an alternative 
strategy which would have resulted in victory. 



1 Introduction 

In [5] , Seel & Strack introduced a model of a gambling contest between agents 
in which the objective of each agent is not to maximise her return, but rather to 
maximise the probability that her return is highest amongst the set of agents. 
One motivation for studying such a problem is that it provides a stylised model 
for competing fund managers, only the most successful of whom will be given 
funds to invest over the next time period. Another distinct strand of the litera- 
ture on modelling competition between fund managers is represented by Basak 
and Makarov [2]. 

In the Seel & Strack paper each agent observes a stochastic process and 
chooses a stopping time to produce a stopped value. The agent wins the contest 
if her stopped value is greater than the stopped value of every other agent, 
and the objective of the agent is to maximise the probability that she wins the 
contest. Our aim is to add a behavioural finance aspect to the problem. Again 
the objective of the agent is to maximise her chances of winning the contest, 



•Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK. 
H.Feng@warwick.ac.uk 

^Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK. 
D . Hobson® Warwick, ac. uk 



1 



but now she is penalised if she has not won the contest, and she has behaved 
sub-optimally, in the sense that there was an alternative strategy which would 
have led to her winning the contest. Thus a fund manager who has followed 
a poor strategy is not merely given a new role within the firm, but instead is 
terminated with disgrace. 

Although the problem described in [B] is very simple, the solution is re- 
markably rich and subtle. Firstly, in equilibrium agents must use randomised 
strategies, so that the level at which the agent should stop is stochastic. Sec- 
ondly, the set of values at which the agent should stop forms an interval which 
is bounded above. Several variants are discussed in Seel & Strack, including 
the extension to the asymmetric case where the starting values of the processes 
observed by the agents are different. 

We will consider the following variant of the problem. The agent's choice 
of stopping rule determines her stopped value. But if with hindsight we look 
at the best possible time she could have chosen then we get a maximum value 
she might have attained. We consider a problem in which the agent receives 
a reward of 1 if her stopped value is the highest across all agents, but she is 
penalised K if her stopped value is not highest, and if she had stopped at the 
maximum value she might have attained then she would have been the winner. 
If she is not the winner, and there is no strategy she might have followed which 
would led to her being the winner then her score is zero. 

In fact we consider three variants of the problem, in which an omniscient 
being (or the agent's supervisor) penalises the agent for stopping too soon, for 
stopping too late, or for stopping too soon or too late. In the first case the agent 
faces regret over stopping too soon, and we consider the maximum value to be 
the maximum value attained by her process after the moment she chose to stop. 
In the second case the maximum is taken only over that part of the path which 
occurs before the chosen stopping time and the agent faces regret over stopping 
too late. In the third case we take the maximum over the whole path. 

Our results are that in the first problem, the effect of the penalty is precisely 
equivalent to an increase in the number of opponents. An increase in K incen- 
tives the agent to aim for higher values, at the cost of stopping at low values 
more often. This is the same as the effect of competition from more opposing 
agents. In the second problem, which is both harder and more interesting, the 
optimal strategy is modified in a more subtle way. This case is relevant if the 
agent's process is unobservable from the point at which is is stopped, for in- 
stance if it is the gains from trade process arising from a dynamic investment 
strategy chosen by the agent. Now the agent faces a risk of a penalty whenever 
she stops below the value of the current maximum. For this reason she is reluc- 
tant to do so, although it is also sub-optimal to wait until her process hits zero, 
as this is a sure losing strategy. An increase in K incentives her to stop more 
quickly. The third problem might be expected to be a combination of the two 
previous problems, but in fact there is a natural simplification which leads to 
the optimum being the same solution as the original Seel & Strack problem. 

The remainder of this paper is constructed as follows. In the next section 
we describe the original contest introduced in Seel & Strack [6]. We rederive 
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the Nash equihbrium, using a different approach based on Lagrangian method. 
Then in Section[3]we introduce the problem with regret, which we then solve in 
the three cases described above in Sections HI [S] and HI Finally in Section [7] we 
explain the origin of the optimal multipliers and the candidate Nash equilibrium 
distribution. The ideas of the proofs are different to those in [5] in that instead 
of trying to write down the value function for the agents we use a Lagrangian 
sufficiency theorem. This brings new insights and yields a simpler proof even 
in the standard problem of Seel & Strack and facilitates our analysis in the 
extended problem. 



2 Contest without regret 
2.1 The model 

There are n players with labels i E I — {1,2, . . . ,n} who take part in the contest. 
Player i privately observes the continuous-time realisation of a Brownian motion 

= )tgR+ absorbed at zero with Xq = xq where xq is independent of i and 
positive. Let J^l = cr{{Xl : s <t}) and set F* = {J^l)t>o- 

The space of strategies for agent i is the space of F'-stopping times r*. Since 
zero is absorbing for X^, without loss of generality we may restrict attention 
to r* < Hq = mi{t > : XI = 0}. Player i observes her own process X\ but 
not X^ for j ^ i; nor does she observe the stopping times chosen by the other 
agents. Moreover, the processes X' are independent. 

The player who stops at the highest value wins unit reward, that is, Vi € /, 
player i wins 1 if she stops at time r* such that X^i > X^j Vj ^ i. If there are 
k players who stop at the equal highest value then these players each win i. 
Therefore player i with stopping value X^^ receives payoff 

1 



where k 



i E I : X^i = maxjg/ X, 



The key insight of Seel & Strack [6] is to observe that the problem of choosing 
the optimal stopping time can be reduced to a problem of finding the optimal law 
for Xr or equivalently an optimal target distribution. The payoffs to the agents 
only depend upon t' via the distribution of X^i. Hence, the problem can be 
considered in two stages, firstly find an optimal target distribution F*, and then 
verify that there is a choice of such that X'^^ has law F\ But, the problem 
of finding r such that Xt has law is a classical problem in probability theory, 
and is known as the Skorokhod embedding problem (Skorokhod [7]). Since X 
is a Brownian motion started at xq and absorbed at 0, any distribution on M"'" 
with mean less than or equal to xq can be embedded with a finite stopping time 
r (and conversely, for any r the law of Xr has mean less than or equal to xo). 

Note that there are multiple solutions to the Skorokhod embedding problem 
for F, so there will be several alternative stopping rules which will bring equal 
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probability of success to an agent. However, we expect that the optimal target 
distribution is unique. 

Our aim is to find Nash equilibria for the problem. By the above remarks, a 
Nash equilibria can be identified with a family of distribution functions {F^)i^i. 
We will say that {F^)i^j is a Nash equilibrium if, for each i G /, if the other 
agents use stopping rules such that X^j ^ , then the optimal target 
distribution for agent i is F^, and she may use any stopping rule such that 
X^i ~ F*. We will say a Nash equilibrium is symmetric if does not depend on 
i, and we will say that a Nash equilibrium is atom- free if each is atom- free. 
Given the symmetry of the situation in the sense that each agent observes a 
martingale process started from the same level xg , it seems natural that a Nash 
equilibria is symmetric. Moreover, simple arguments over rearranging mass can 
be used to show that it is never optimal for two agents to put mass at the 
same positive point x — either of them could benefit by modifying the target 
distribution to put a proportion N/{N + 1) of this mass at {x + N^"^) and a 
proportion 1/{N + 1) at (x — iV~^) - and then it is possible to deduce that any 
optimal solution is atom-free. 

Theorem 2.1. [Seel & Strack fB^] Any Nash equilibrium has the property that 
it is symmetric and atom-free. 

Remark 2.1. The fact that the Nash equilibria is atom- free relies on the fact 
that the situation is symmetric in the sense that all agents stop Brownian mo- 
tions started from a common value xq. If the agents observe processes with 
different starting points, then the Nash equilibria may have masses at zero for 
some agents. In that case, for a Nash equilibrium, no agent places mass at a 
positive point, and at least one agent has an atom- free distribution. We will 
only consider the symmetric case. 

Remark 2.2. In the Seel & Strack setting, which we call the standard case, 
we will focus on proving that there exists a unique symmetric, atom-free Nash 
equilibria. It will then follow from the results of Seel & Strack [B] that this is 
the unique Nash equilibrium for our problem. Our methods can be extended 
to show that every Nash equilibrium has the property that it is symmetric and 
atom-free, but we will not present those arguments here. When we consider 
the problem with a penalty for using a losing strategy when a winning strategy 
exists, we will again prove the existence of a unique symmetric atom-free Nash 
equilibrium, but intuition gained from the standard case indicates that this 
equilibrium is unique. 

2.2 Derivation of the equilibrium distribution 

This section is devoted to a proof of the following result, first proved in Seel 
and Strack |B] using different methods. 

Theorem 2.2. There exists a symmetric, atom- free Nash equilibrium for the 
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problem for which X^i has law F(x), where for x > 



F(x) = mill \ — , 1 
I V "^^0 

Proof. Let A be the set of non-decreasing functions / : [0, oo) i— > [0,oo) which 
are null at zero, and let Ad{xq) be the subset of A corresponding to distribution 
functions of random variables with mean xq. Then, 

^oixo) = \ f '■ [0,oo) i-^- [0,oo) such that /(O) = 0,/ non-decreasing, lim f{x) = 1, / 

We seek a symmetric atom-free Nash equilibrium. Since there are no atoms, 
we do not need to consider how to break ties and a symmetric Nash equilibrium 
is identified with a distribution function G* G AD{xa) with the property that 

G*{xT-^G*{dx)> G*{xT-^G{dx) yG^Aoixo). 
Jo 

Suppose that the other players all choose F{x) as their target distribution. 
Then the problem facing the agent is to choose G to solve 

/•oo /'OO pOO 

max / FixY^^^Gidx) subject to / xG(dx) = xq and / G(dx) = 1. 

Gix)€AJo Jo Jo 

(1) 

Introducing multipliers A and 7 for the two constraints, the Lagrangian for the 
optimization problem ([T]) is then 

poo 

Cf{G; a, 7) = / [Fix)"'^ - Aa; - 7] G(dx) + Xxo + 7- 

Now we state a variant of the Lagrangian sufficiency theorem for our problem. 
Proposition 2.1. If G* , A* and 7* exist such that G* e Ad{xo) and 

Cg*{G*;X*,j*)>Cg*{G;X*,^*) (2) 
for all G ^ A, then G* is a symmetric, atom-free Nash equilibrium. 
Proof UGeAoixo) then 



xf{dx) = xo 



G*{xY-^G{dx) = Cg'{G;X*,j*). 
Then, under the hypotheses of the proposition, 

G*{x)"-^G*{dx)=CG'{G*;X*,j*)>CG'{G;X*,-f*)= / G* {x)"-^Gidx). 

□ 
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Return to the proof of Theorem l^?^ On [0,oo) let G*{x) = min | ""{/a;/(nxo), l|, 

A* = l/{nxo) and 7* = 0. We verify that for these multipliers ([2|) holds and 
that G* G Ad{xo). The latter follows immediately from the explicit form of 
G* . For the former 

/>oo 

£g*{G;X*,j*)^ / [G*{xy''^ -X*x-j*]G{dx)+X*xo+j* 







00 



1-^ 

nxQ 



G{dx) + -< - = Cg'{G*;X*,Y)- 
n n 



Thus there exists a symmetric, atom-free Nash equilibrium of the given form. 

□ 

Remark 2.3. Seel & Strack |5] solve the problem by writing down a candidate 
value function for the problem, and then verifying that the candidate value 
function is a martingale under an optimal stopping rule for each agent. 

Remark 2.4. Our results can be extended to the case where the processes 
observed by the agents are independent copies of some time-homogeneous dif- 
fusion process Y which converges almost surely to the lower bound on its state 
space. The idea is to use a change of scale, and, in the setting of Skorokhod 
embeddings, can be traced back to Azema and Yor [T]. In addition to Brownian 
motion (absorbed at zero), canonical examples include exponential Brownian 
motion and drifting Brownian motion with negative drift (with or without ab- 
sorption at zero) . Seel & Strack ^ consider the problem when is a Brownian 
motion with positive drift, absorbed at zero, but then they need to impose a fur- 
ther condition on the model parameters to ensure the finiteness of the candidate 
solution. 

Let the state space of the time-homogeneous diffusion Y be an interval S 
with endpoints {l,r} with —00 <l<YQ = yo<r< 00. Suppose that Y 
is a solution of the stochastic differential equation dY — a{Y)dB + b{Y)dt 
and let s = s{y) be the scale functior0. Then s is an increasing solution of 
a{y)^s"{y) + 2b{y)s'{y) — and X = s{Y) is a continuous local martingale 
with starting value xq — s{yo), and hence a time-change of Brownian motion. 
Moreover, our assumption that Y converges to the lower boundary implies that 
s{l) is finite whereas s{r) = 00 and without loss of generality we may set s{l) = 0. 
Then X = s{Y) converges to zero almost surely (and if zero can be reached in 
finite time, then zero is absorbing). 

Note that s(-) is a continuous strictly increasing function. Hence the payoff of 
player i with stopping value Y^i can be expressed as -^^^x^ =max ej }' where 

k = |i G / : X^i ~ maxjg/ I , and X!^i ~ s(Y^\). Then the contest in 
which players privately observe Y^ is equivalent to the contest in which players 



^If y is exponential Brownian motion, dY = oYdB + bYdt then s{y) = with k = 
1 — 2b/ a^. Note that we need parameters such that k > to ensure that Y is transient 
to zero. If Y is downward drifting Brownian motion, dY = adB + bdt with b < 0, then 
s{y) = e-™ with rj = 2b jo?. 
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privately observe X', and the choice of the optimal r* is the same for both 
problems. In particular, if we have a Nash equilibrium for which is optimal 
for the processes X', then we also have a Nash equilibrium for the processes F*. 

The problem is then to find a Nash equilibrium (G'*)ig/ for Y^^ and then 
verify that there exists r* such that Y^i has law G\ Under our transformation, 
this is the same as finding a Nash equilibrium (i^*)igj for X^i where = 
G* o ,s~^, where is the inverse of s. To solve the problem for X, then either 
we argue that the only properties of X that we use are the strong Markov 
property, the local martingale property, and the fact that X converges to zero, 
so that the theory of this section applies to the local martingale diffusion X, or 
we argue that since X is a non-negative martingale diffusion, X is a time-change 
of Brownian motion and Xf = for some increasing functional Ft. Then if 
F is any distribution with mean less than or equal to xq, and cr is a stopping 
time such that B^^ ~ F, then we may take r = o a and then X^ = -Bo- ~ F 
and r is an embedding of G in F. 

Note that if X^- ~ F and F has mean xi < xq, then there exists {F, f ) such 
that F has mean xq, F > F and Xf ~ F. Clearly f dominates r as a strategy. 
Hence we may restrict attention to stopping times r such that the distribution 
F of X has mean xq, and then (XtAT-)t>o is a martingale and not just a local 
martingale. 

3 Contests with regret 

Our goal is to solve an extended version of the problem in which agents are 
penalised for following losing strategies, if they had an alternative stopping rule 
which would have won the contest. The idea is that thcirc is an omniscient 
judge who can observe the path of X*, and not just the stopped value, and 
who penalises the agent for the failure to use a winning stopping rule if such a 
strategy exists. This judge represents the; super\'iKor of the agent, and the agent 
faces penalties (such as dismissal) in cases where after the fact she is seen to 
have followed a losing strategy, when a winning strategy existed. 

In our model there are n contestants, each of whom privately observes their 
own process X* and the player with the highest stopping value wins unit reward. 
That is, Vi e /, player i wins 1 if she stops at time r' such that X^i > X:^^ 
Vj ^ i. In addition the player is penalised > if her stopped value is not 
highest, and if she had an alternative strategy which would, with the benefit 
of hindsight, have allowed her to win. (The case K = corresponds to the 
standard problem.) Given that the best strategy for agent i is to stop at the 
maximum value attained by X* this means that player i loses K if she stops 
at r* such that X^ < max^^j X^^. < M\ 

As before, ties are broken randomly. If there are k players who stop at the 
highest value then these players each wins p Further, player i loses K2 if she 
stops at r' such that X^j < maxj^j X^^ = M% where < K2 < K. Hence 
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player i who stops at X^i with maximum value M' has payoff 

where fc = ||z S / : = max^g/ 1 1 . 

Our objective is to find a Nash equilibria which is represented by a family of 
stopping mlcs (r'). Since the values (X*i.Af*) are a sufficient statistic for the 
problem, the Nash equilibria can be characterised by the law v"^ of (X*i,M*). 
Then in equilibrium, the agent can use any stopping rule for which (X*i,M*) 
has law . We write for the marginal of v which corresponds to the law of 

r' 

In the standard case, every Nash equilibrium is symmetric and atom-free. 
In our generalised setting we will limit our search to symmetric atom-free Nash 
equilibria, and we will show that there exists a unique such equilibrium. Mo- 
tivated by the situation in the standard case we conjecture that every Nash 
equilibria is syumietric and atom-free and therefore that we have found the 
unique equilibrium. 

Remark 3.1. Since there are no atoms, the probability of a tie is zero. Thus 
neither the method of breaking ties nor the value of will affect our results. 

Suppose that the other players all choose F(x) as their target distribution 
of Xt. Then agent i aims to choose a feasible measure v{x-,y) for (X*i,M*) to 
maximise 

= (1 -h -ff)E {F(XiiY-^\ - iTE [F(M0""^] , 

= / [(1 + K)F{xr-^ - KF{yr-^] j.{dx, dy), (3) 
Jo Jo 

which is her expected payoff when she stops at t\ 

So far we have been imprecise about the definition of M*. The quantity 
represents the maximum the agent could have achieved. Depending on the inter- 
pretation, this could be the maximum over the entire path M* = maxj^^; < 
t < Hg}, or it could be that only that part of the path before the agent's chosen 
stopping time is considered, M* = ma,x{Xl;0 < t < r*}, or only that part of 
the path after the agent's chosen stopping time, = max{Xj;r' < t < H^}, 
where Hq — inf{t e M+ : Xj: = 0}. These different interpretations will lead 
to different Nash equilibria. We consider the three cases separately in the next 
three sections. 



4 Contest with regret over future failure to stop 

In this section we consider the contest in which the agent is penalised for stop- 
ping too soon. We consider the maximum value M* to be defined by 

^'^[Vs//^] = ,sup xi 
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Theorem 4.1. There exists a symmetric, atom-free Nash equilibrium for the 
problem for which X^i has law F{x), where for x > 



F{x) = min 



,1 



with N = n + K{n - 1). 

Remark 4.1. The agent follows exactly the same optimal strategy as an agent 
in a different setup, where there is no penalty, but the total number of contes- 
tants is increased to N = n-\- K{n — 1). 

Proof. Denote by v the joint distribution of and Afp and denote by 

G(x) the marginal distribution of X^;. Then using the strong Markov property 
and the martingale property of X , 



v{[Q,x] X [0,2/]) = nxu < ^^Mi^'M},] ^y^^ L ^^K'M^] ^ y\^r' = ^)G{dz) 

y- z , 



-{Hl,<Hl\Xl = z)G{dz) 



y 



-G{dz) 



(4) 



where Wy = inf{i e R+ : Xl = y\. 

Suppose that the other players all choose F{x) as their target distribution 
of Xt . Substituting (U) into ([3]) , the expected payoff of player i becomes 



OO /'CO 



[(1 + K)F(x)'''-^ - KF{yY-^\ iy{dx, dy) 

"'0 

I (1 + K)F{xy'-^G{dx) - / KF{yY'-^—dyG{dx) 

Jq Jx y 

F{yr-^ 



{l + K)F{x) 



n-l 



Kx 



y 



-dy 



G{dx) 



max 



Given other players' choices, player i would like to choose G <E Aio solve 

F{yr-^ 



/ {l + K)F{x)'^-^ - Kx 

Jo I Jx 



-dy 



poo poo 

G{dx) subject to / xG[dx) — xq and / G{dx) 

Jo Jo 
(5) 



= 1. 



Introducing multipliers A and 7 for the two constraints, the Lagrangian for the 
optimization problem ([S]) is then 



£f{G;X,^)^ 



(1 + K)F{xy'-'^ - Kx 



dy - Ax - 7 

y 



G{dx)+Xxo+j. 



On [0,(X))let G'*(a;) = min|l, "'{/x/iNxo)^ A* = l/(A^a;o) and 7* = 0, where 
N = n + K{n — 1). It is easy to check that G* G Ad{xo). Moreover, 

G*(2/)"-i 



£g*(G;A*,7*) 



{l + K)G*{xY 



Kx 



-dy — X a; — 7 



G{dx) + A* xo + 7* 



Nxo 



Nxo 



G(dx) + l<l = /:G-(G*;A*,7*). 
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Hence, by the Lagrangian sufficiency theorem (Proposition 12. ip G* is a sym- 
metric, atom-free Nash equiUbrium. □ 



Remark 4.2. In this version of the problem, the stopping decision depends on 
the current value of X alone, and not on the current maximum. This is because 
the penalty depends on the future maximum, which conditional on the current 
value of the process is independent of the past maximum. 

5 Contest with regret over past failure to stop 

This section discusses the contest with regret over past failure to stop, that is 
player is penalised when she could have won if she had stopped sooner. This 
case is relevant when the omniscient being can only observe the realisation of 
up to the stopping time chosen by the agent. In this case the maximum 
value is defined by 

AP := M;. = sup Xi 

0<t<T» 

Consider the problem facing a single agent under the assumption that the 
strategies of the competing agents are fixed. Temporarily we drop the subscript 
denoting the label of the agent. Recall that the payoff to the agent is (1 -|- 
K)F{Xr)"'~^ — KF{Mr)"'~''' ■ For a continuous martingale Kertz and Rosier [i] 
characterise all possible joint laws of {Xr, Mr) and hence the problem is reduced 
to a search over measures with these characteristics. However, an alternative 
is to split the optimisation problem into a two-stage procedure: first for any 
feasible distribution of X^ (a non- negative random variable with mean xq) find 
the joint law of {Xr, Mr) for which Mr is as small as possible in distribution — 
such a joint law exists by results of Perkins [5 — and then minimise a modified 
objective function over feasible laws of Xr- 

For a given law of Xr the joint law of {Xr, Mr) for which Mr is minimised 
is such that mass is placed only on the set A = {{x, x);x > xo}U {(x, $(a;)); x < 
xo} where : (0, xq) ^ {xq, oo) is a decreasing function (and if Xr is atom-free, 
a strictly decreasing function). Let cj) be inverse to $. Then, if G denotes the 
marginal law of X, we can conclude from Doob's submartingale inequalit}]^, in 
conjunction with the set identity {Mr > m) = {Xr > m) U {Xr < (j>{m,)), that 
for m > Xq 

j>oc j>(f){m) 

= ¥.[m-Xr;Xr > m]+E[rn~Xr; Xr < 0(m)] = / {m-y)G{dy)+ / {m-y)G{dy) 

Jm Jo 

(6) 

which, smcc Xr has mean xo, is equivalent to 

= m - XQ + {m - (j){m))G{(j){m)) - G{y)dy. (7) 

J 0(m) 

■^By the final remark of Section 12.21 wc may assume {XtAT)t>0 is a martingale, and then 
we have, for m > xo, mP(Mr > m) = E[Xt; Mr > m\. Hence = E[m - Xr] Mr > m]. 
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In differential form, assuming G and (j) are differentiable, this becomes 



= (t>'{m){m - (t>{m))G' {4>{m)) + 1 + G(<?!)(m)) - G(m). (8) 

It follows from the results of Perkins |S] and Hobson and Pedersen that if 
G is the law of an atom-free non-negative random variable, then there exists a 
decreasing function (f) solving Further, if f is a random variable such that 
for s> xq 

G{du) \ 



> s) = exp ( - y 



1 - G{u) + G{ct^{u)) J 

and if r = A T0 where — mi{t > 0\Mt > £,} and = inf{t > Q\Xt < 
(j){Mt)}, then Xt has law G and {Xt,Mt) places no mass off A. Moreover, 
amongst the class of joint laws for (Xt-,M^) such that Xr has law G, Mr is as 
small as possible in distribution. 

Theorem 5.1. Suppose there exists a finite real number r > xq, a once differ- 
entiable strictly decreasing function <j) : [xo,r] i— [0,a;o], a thrice differentiable 
strictly increasing and strictly convex function tp : [xo,r] i— > [0,1] and a once 
differentiable strictly decreasing function 9 : [xo,r] [0, 1] such that (j), -0 and 
9 solve the following system of equations 



(t) 



cj>'{y)i^\y)^{l + K)9\y), (9) 

K^'{y)^{v-ct>{y))r{y). (10) 

^—^9'{y)^U{y)^^-l)9{y)^^^9{y)^ (11) 
rt — 1 V / 



and satisfy that (j){xo) — Xq, i^ir) — 1, tp'{r—) — , ip"{r~) — ^^'^^^'^ and 
9{xn) = ip{xo)- 

i) Then 

9(y)^ib(y) ^ . (12) 

ii) Moreover, there exists a symmetric, atom- free Nash equilibrium for the 
problem for which X^^i and Al^i have joint law v* that is determined by the 
marginal distribution G* of X^i, given by G* (x) = for a; < 0, G*{x) = 1 for 
X > r and 

n*( \ /^(0"'(a^))^ , ifQ<x<xa, , 
G . (13) 

h/;(a;)"-i , ij Xq < x < r, 



otherwise, and the conditional distribution of APi given X^i such that 



, ^fX'^,>xo, 
0-1 (X;0 , tfO<X^,<xo. 



Proof. The conditions in the theorem imply some properties of function </>: let 
y — r— in (fTO]) then since ?A'(r— ) = ^^-^ and i]y'{r—) = ^^^■i'^'' we have 
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(f){r) = 0; since ([TU)) holds and by the positivity of tp' and ip" we have (j>{y) < y 
on {xo,r). 

i) Integrating ^ with respect to y, 

{K+l)e{z)-{K+l)e{xo) = f <l>'{y)tP'{y)dy^^{z)^'{z)~4>{xo)ij'{xo+)- f <l>{y)^" {y)dy . 

(14) 

Rearranging (fTO)) and integrating, 



0(y)^"(y)d2/ = / y^"{y)dy- / X7/;'(2/)dj/ = zV''(^) - a;oV''(a^o+) - (1 + ^)V-'(^) + (1 + i^)V'(a;o). 



Combining this equation with (jl4p we find 



^(j;) = ;^^('^(y)-y)V^'(2/) + ^(2/)- (15) 



Then substituting (|TOl) into ([15]), (|T2j) follows. 

ii) Let £{xo) be the set of measures v{dx,dy) on [0,oo) x [0,cxi) such that 
i/{dx, dy) has no mass on {(x, y) : y < x or y < xq}, and let £d{xq) be the subset 
of £{xq) corresponding to probability measures of a pair of random variables 
X <Y such that X is a continuous random variable with mean xq and E[X — 
z]Y > z] — Q for all z > xq. Note that the last equation comes from the Doob's 
submartingale inequality, applied in the continuous martingale case. 

Suppose that the other players all choose F{x) as the target distribution of 
Xr- Then the aim of player i is to choose v £ £{xq) to solve 

max (/ / [{1 + K)F{xY'-^ ~ KF{yy'-^v[dx,dy)\ (16) 

subject to xv{dx,dy) = xo, iy{dx, dy) = 1 and J!^qJ^^{x- 

z)v{dx, dy) = Vz > xq. 

Introduce multipliers A and 7 for the first two constraints, and for each z > 
Xo introduce a Lagrange multiplier rj (z) for the last constraint: the constraint 
becomes Sz=xo {^(•^)(^ ~ z)dz}v{dx,dy) = 0. Then the Lagrangian for 

the optimization problem (|16p is 



Cf{v;\i,v) (17) 



fV 

(1 + if)F(x)"-^ - KF{y)"--' - Ax - 7 - / r]{z){x - z)dz 

J Xo 

By a simple extension of the Lagrangian sufficiency theorem given in Proposition 
I2.1l we have that 

Proposition 5.1. If v* , A* , 7* and rf exist such that v* £ ££)(xo) and 

Lg' i'^* ■:X*,1\V*)> {v; X\r,v*) (18) 

for all V G £{xo), where G*{x) = v*{[0,x] x (0,oo)), then v* is a symmetric, 
atom-free Nash equilibrium. 



v(dx, dy) + Axq + 7. 
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On [0, oo) X [0, oo) let be the joint law given in the theorem and G* be 
its marginal distribution with respect to Xr- In particular, G* is given by (IT3)) . 
Let A* = ip'ixo+), 7* = ^(a;o) - xqiIj' {xo+), i]*{y) = -0"(y) for xq < y < r, and 
^*(2/) = for y > r. We will show that for these multipliers ^TE\\ holds and that 
u* e Soixo). 

To prove v* G ££)(xo), we need to show G*(0) = 0, hm^^-f oo G* (y) — 1, 
G*{y) is continuous and non-decreasing, Jp uG*{du) = xq and J^^q Jy^^ix — 
z)i'*{dx, dy) = for all z > xq. 

Letting y = r— in p^ . wc find 6'(r) = 0. Then since (/)(r) 0, we have 
G*(0) = 6'((/)-i(0))i/("-i) = 6l(r)i/("-i) = 0. Moreover, limy^ooG*(?;) = 1 
follows from the finiteness of r. We have (f>~^{xo) = xq and hence G* is con- 
tinuous at Xq . Since both cj) and 6* are decreasing and continuous on [a^o , r] , 
G* (x) = 6((t>~^{x)y^'^''^~^^ is increasing for x e [0,xo]. Then, since ^(y) is in- 
creasing on [xq , r] , G* is continuous and non-decreasing on the whole interval 
of [0,r]. Note that this implies r = sup{a; > : G*{x) < 1}. 

For y > xo we have G*{(j){y)) = 6i(y)i/(""i) and so 0'(?;)(G*)'(0(?;)) = 
6l(y)i/("-i)-i6)'(y)/(n- 1). Then, using ([TT]) 

<^'(y)(y-0(y))(G*)'Wy)) = e{y)^y^^e\y) = v(y)^-i-%)^ = G*{y)-i-G*{<p{y))- 

n — 1 

Hence 

0'(y)(y - 0(y))(G*)'(^(y)) + (l - 0'(y))G*(^(y)) = G*(y) - 1 - 0'(2/)G* (</>(y)), 
and integrating from a; to r 

-{x-<j>ix))G*iq^ix)) = ~ir~x)+ f G* {y)dy + j'^^''^ G*{y)dy. (19) 



Then, setting a; = xo we recover xo ~ {1 — G* {y))dy so that a random variable 
with distribution function G* has mean xq. 

Finally, from its construction we have that v* only puts mass on A. Hence, 
from ©, 

OO f4'i^) 

{x- z)iy*{dx,dy) = {x-z)G*{dx)+ (x - z)G*(dx) = 0. 

x—0 J y—z Jo J z 

Now we prove that ^ holds. Leti*(x,y) = (l+i^)G*(x)"-i-i^G*(y)"-i- 
A*x-7*-/^^^^ r]*{z){x-z)dz and then Cq' {v; X*,j*,r]*) = L* (x, y)i^{dx, dy)+ 

A*xo+7*- 

For notational convenience, extend the domain of ip to [0,r] by defining 
V'(x) - 0{(t>-Hx)) for X € [0,xo). Then, for x < xq, = ''gl'igij = 

^ ''^+1'"'*'' > where the last equality comes from ©. Moreover tp" (x) = 
(i+K)t' i^-Hx)) Thus V' is increasing on [0,r], iIj"{x) < if x S (0,xo) and 
'0"(x) > if X e (xo,r). 
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Fix y £ [xo, r]. For any < x < j/, 

L*{x,y) = {l + K)i;{x)-K^b{y)-^y{xo+)x-iP{xo) + xo4''{xo+)- I ^"iz)ix 
= il + K){^jix)-^{y)) + {y-x)^P'{y). 

Weh&vcL*{cf,{y),y) = {1 + K){d{y) - i^{y)) + {y - ^{y))^^' (y) and then by ^ 
and (Uni), L*{(j){y),y) = 0. It is also clear that L*{y,y) = 0. 
Differentiating (l20l) with respect to x, 



or* r * 

^(x, y) - (1 + i^)V'(x) - ^{x. y) - (1 + 

Then^(x,2/) < on (0, xq) and f^(a;, y) > Oon(xo,y). Since ^^{(j){y),y) = 
{l + KW{<i,{y))-i,'{y) = {l + K)^-^'iy) = and = i^V'(y) > 0, 

it follows that ^{x,y) > if x e {0,cj){y)), ^{x,y) < if x € (</)(?/), x) and 
^{x,y) > if X e {x,y), where x S (xo,y) is such that ^(x, y)|a;=s = 0. It 
follows that L*{x,y) < for x e [0,(t){y)) U ((/)(y),2/). 

Now fix 2/ > r. For any < x < y, and writing tp^x) = ip^x) — x/r, 



L*{x,y) = {1 + K)^Ij{x) - K - 'iiy{xo+)x~ iljixo) + xo^|y{xo+) - / ^p" {z){x - z)dz 

= (1 + K){tlj{x) - 1) + (r - x)V''(r-) = {I + K) (tP{x) - ^) (1 + K)i){x) 

If X G (r, y] then L*{x, y) = (1 + K)^[r — x] < 0. Now suppose x g (0, r). Since 
^(r) = 0, we have V'(0+) = ^ and thus Vi'(0+) = 0. Further i^'{r-) = f > 0. 
Then by the sign of ip"{x), we get ip'{x) is negative and then positive on (0,r). 
Since ip{0) = ip{r) = 0, we deduce that ip{x) < on (0,r). Thus L*{x,y) < 
for X G (0, r). 

From above analysis, we know L* (x, y) < for any (x, y) such that < x < y 
and y > xq. This means that e £{xo) 

/■oo /"OO 

Cg'{v\\* n\ri*)^ / i*(x,y)z/(dx,dy) +A*xo+7* 

Jo "'0 



oo /"OO 



L*{x,y)vidx,dy) + ^(a^o) < ^(a^o) = -Cg- (j^*; A*, 7*, ?]*) 

/o Jo 

Thus v* is a symmetric, atom- free Nash equilibrium from Proposition 15. II □ 

It remains to show that there exists a constant r and functions (</), 0, ip) which 
satisfy the hypotheses of the Theorem 15.11 and hence that a symmetric atom- 
free Nash equilibrium always exists. The following lemma is key in defining the 
appropriate entities. 

Lemma 5.1. Let J{u) solve the ordinary differential equation 

^ ^~ (i^+i)[i-u- ^ ' 
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subject to J(0) = andu>0. Let u* = sup {u : J{u) < (1 - u)^/^"-^)}. 

i) Define 

^ ' {K + l)[z-J{l-zY-^] 

on [-2*,!], where z* = 1 — u* . Then z* > 0, H is positive on (-2*,!) and 
/^%xp (^JlH{v)dv)dw<{K + l). 

ii) Define 

xo{K + 1) 

{K + l)-J^^e^p[jlHiv)dv)dw 

and 

^' K + 1 



{K + 1)- exp(^J H{v)dv^ dw 



on [z*,l]. Let ip = ^/ ^ be the inverse function of 5*. Then Xq < r < oo 
and ip : [xo,r] i-)- [0,1] is a strictly increasing and strictly convex function that 
satisfies %b{r) = 1, ^'(r-) = ^ and V"(r--) = ^^f/^^ ■ 
Hi) Define 

*<»)--^- 

Then (j) : [xQ,r] [0, a;o] is a strictly decreasing function with ^{xq) = xq. 
iv) Define 

0{y) = ^(xo) + r ct>'{z)i,'{z)dz 

Then : [xo,r] h- > [0, 1] is a strictly decreasing function with 0{xo) = iI)(xq). 
Moreover, e{y) ^ ijj{y) ^ {y ^ 4>{y))ij' (y) / {K + 1). 

Proof. It is easily seen that J{u) is a strictly increasing function at least until 
J{u) = ""-v/l — u, and that u* < 1. 

i) Since u* < 1, z* = 1 — u* > 0. Since J is increasing, for any u E (0,^*) 
J(u)"-i < J(u*)"-i and thus 1-u- J{u)"~^ >l-u- J{u*)"~^ = u* - u. 
This means for any z & [z* , 1) we have z — J(l — z)^~^ > z — z* > Q and then 
< ^(^) < iK+if{z-.') - Moreover, 

/ exp f / — — ^ -dv\ dw = [ exp(—J^ — In^^ — —] dw = [ (— — ^| dw 

Jz' "^KJ^ {K + l){v-z*) J J,, w-z*J J,,\w-z*J 

= (1 - z*)T^{l - z*)T!TT{K + 1) = (1 - z*){K + 1)<{K+ 1), 

and it follows that J^, cxp ^ H{v)dv^ dw < {K+1). 

ii) Since < {K+l)-^l, exp (^/J, H{v)dv^ dw < {K + 1), we have that xq < 

r < oo. Taking derivatives of ^ on (z*, 1), we find '^'{z) — -j^-pi exp ^ H{v)dv^ > 
and = < 0. Then since = xq and * (1) = r, * is 
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a strictly increasing and strictly concave function from [z*,l] to [xo,r]. Thus 
ip = ^I^"^ : [xo,r] > [0, 1] is a strictly increasing and strictly convex function 
satisfying ?/>(r) = 1. 

Moreover, ^'(y) = wj^kyT) ^^^^ '^"^^^ " '^-WW^j^- ^hen since 
^{1) = r, vl,'(l-) = ^ and = -p^, we get i^ir) = 1, V/(r-) = 

1 _ K+l „„j ^ _ ^"{l-)i:'{r-) _ K{K+1) 



iii) Letting u = 1 - z in dUl), we get J'(l - z) = Ik+i)[z-j{i-z)-^-\ 



K 



Thus, 



H'{z) 



K[l + {n - 1) J(l - - z)] 

(K + l) [z - J(l - zY-^f 



K 



l + [n-l)J{l-zr-^^[j{l 



z) + 1 



^^Hizf - (^^±il^i/(.)3 [j(l - + ,7(1 - (l - 

(23) 



It is clear from the middle line above that ^rrr^ < — 

Note that (t)'{y) = 1-K (l - ^^i^Mg^^ . Since H{z) = 
we have that 

Him) - (24) 

andthusi/'(^(y))V;'(2/) = Thus l-^^^^l™ = ^H'my))p^ 

1 and therefore 

It follows that (j)'{y) <0. 

Since tp^r—) — ^^r-, i^'iy) > and '0"(?/) > on (xq, r), we know tp'{xa+) is 
bounded. Then, since = H{iI}{xq)) = H{z*) = +oo, we get il}"{xa+) = 

+00. Substituting these values into ([22]) we obtain (/i(a;o) = xo- 

iv) The statements about 6 are either trivial, or follow as in the derivation 
of (HI]). 

□ 



Theorem 5.2. Lei r, t/j, <j), 9 he as defined in Lemma \5.1\ 

Then there exists a symmetric, atom-free Nash equilibrium for the problem 
for which XJ^i has distribution F where F{x) = for x < 0, F{x) ~ 1 for x > r 
and otherwise 

^ 9{(f)^^{x))'^ ifO<x<xo, 
"0(0;)"^ if xo < X < r. 



F(x) = 
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Proof. By m and dH, = ?^ = ^^i^'iv)- Hence ^(y) = ^(y) 

= '^(^ " V-'(2/))""^- Thus letting z = iPiy) in 



and using (|25p and rearranging above equation, 
y-<j){y) (j)' {v)%l)' [y) _ 



n - 1 



X + 1 



%)+ l-^(y)T^^(y) 



Then substituting © into above equation, pT|) foUows. Therefore, (f) holds. 
Then using Theorem l5.ll we obtain the symmetric, atom- free Nash equilibrium 
given in Theorem 15.21 □ 



Example 5.1. As an example we consider a 3-player contest. Set xq = 1 and 
n = 3. In Figures ?? and ?? we give graphs of the optimal distribution G*{x) 
and its density function g*{x) for various values of K. 

Figure 1 here. 

Figure 2 here 

As we can seen in Figure ??, the right endpoint r = r{K) of G* {x) decreases 
as K increases. Moreover r{K) tends to n = 3 as if decreases to and tends 
to xq as K increases to +00. We also find that G*{x) tends to the equilibrium 
distribution of the original contest as K decreases to and G* (x) tends to the 
Heaviside function ^{xyxo} as K increases to +00. From Figure ??, we find 
g*{x) jumps at xq if if > and g*{x) tends to +00 as y tends to 0. 

Intuitively, if K is very large then the player does not aim for large values 
of the stopped process, for then she risks a moderate value of the maximum 
together with a small and losing value for the stopped process. Because of the 
large penalty she wishes to avoid such outcomes. 

Example 5.2. In the 2-player contest we can give explicit expressions for several 
of the quantities of interest. 

Set n = 2. Substituting dH), ^ and ([12]) into we get 

(y - Hy))T^'l^'iy)i^'(.y) = Wy) - 1) - m + f,.^'f}^U \y)- 



l + K 



K+1 K 



Defining (p{y) = y — 4>iy) the above equation simplifies to ip{y)ip'[y) — 
Differentiating this expression and using (|10l) we have 



4''(y) ' 



MyWiy)r -iK + 1)-^ = -K,'iyl 



and then 



'P{y)f'{y) = -Kifiiy) + Kip{r) + (p{r)ip' (r-). 



(26) 
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Since (^(r) = r and ^p'{r—) = we have ip'(r—) = 1. Then becomes 
if(i/)if"{y) = —K(p(y) + r{K + 1), which using the boundary condition tp{r) = r 
has solution 



Using ip{xo) 



V 



r{K + 1) 



(t>{xo) — 0, we find 
r = xo- 



ln({K + 1) 



if2 



K(p{y)\ r-ip{y) 



K 



'{K + l)[K-\n{l + K)] 
and therefore the implicit form of ^{y) for y G [a;o,r] is 



/s: X-ln(l + X) 



In 



1 - f{v) 



K -\xi{l + K) 
Kxo 



and ipiy) — y ^ v{y)- It is possible to express ip and 6 in terms of ip, and thence 
the optimal distribution G* of X% and the optimal conditional distribution of 
ikPi given X*;, but these expressions are not so compact. 



6 Contest with regret over failure to stop at the 
best time 

In this section we discuss the contest with regret over failure to stop at the best 
time. A player experiences regret if she could have won if she had stopped at 
the maximum value over the whole path. The maximum value M* is given by 

:= M\j, = sup XI . 

° 0<t<Hl 

Theorem 6.1. There exists a symmetric, atom- free Nash equilibrium for the 
problem for which X^i has law F{x), where for x >Q 

Proof. The agent's expected payoff is 

(1 + K)'&[F{Xl,Y~^] - ifE[F(M^.)"-i] 

But the latter term is independent of the stopping rule used by the agent. 
Hence, in determining her optimal strategy the agent need only consider (1 + 
K)E[F{XliY-^]. Modulo the initial constant, this is the same objective func- 
tion as in the standard case. □ 

Remark 6.1. The agent follows exactly the same Nash equilibrium strategy 
as an agent in the original contest, in which there is no penalty. The intuition 
behind is that the regret is determined by Mlri but player cannot change the 



distribution of ; by changing the choices of stopping time . 
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7 Derivation of the equilibrium distribution 



This section is intended to illustrate how we derived the optimal multipliers 
and the candidate Nash equilibrium in Sections [H |4] and [5] and also the bound- 
ary conditions in Section [S] The Lagrangian approach gives a general method 
for finding the optimal solution, which is distinct from the ideas in Seel and 
Strack [5], and can be generalised to other settings. 



7.1 Contest without regret or with regret over stopping 
too soon 

Recall the definition of the Lagrangian Cf{G; A, 7) for the optimization problem 
(dl and (O. (We cover the more complicated case of regret from continuing 
beyond a winning time in a separate section.) 

Denote by (2;) by the integrand in so that £i?(G'; A, 7) = Jp Lp{x)G{dx)+ 
Xxq + 7. In order to have a finite optimal solution we require Lp{x) < on 
[0, (X)). Let "Df be the set of (A, 7) such that /^^(S'^jT) has a finite maximum. 
Then Dp in defined by 

Vf = {(A, 7) : Lf{x) < on [0,oo)}. 

In order to reach the maximum value, we require G{dx) — when Lf{x) < 0. 
This means that for (A, 7) G Vf the maximum of Cf{'] A, 7) occurs at G* such 
that G*{dx) = when Lf{x) < 0. Conversely we expect that G*{dx) > 
when Lf{x) — 0. If the Nash equilibrium is symmetric then we must have 
G*{x) = F(x) and then Lc-ix) < 0, and Lg-{x) = when G*{dx) > 0. 
Introduce a — mi{x : G*{x) > 0} and b = sup{a; : G*{x) < 1} which are the 
limits on the support of G* . 



7.1.1 Contest without regret 

For the optimization problem ([T]), Lf{x) = F{x)"~^ — Ax — 7. Observe that 
< F (0)" so that if (A, 7) £ Vf then 7 is non-negative. 

Since Lg'{x) = G*{x)^^^ — Ax — 7, we must have G* (x) — ""\/Ax + 7 at 
least when G*{dx) > 0. Since we are searching for atom- free solutions we must 
have G*{x) = ""\/Ax"+7 on the whole of the interval [a, b]. Moreover, since G* 
is non-decreasing and not constant we must have A > 0. 

Since G* is atom-free, G*{a) = and hence Aa + 7 = 0. Then by the non- 
negativity of a and 7 and the positivity of A it follows that 7 = = a. Thus 
G*{x) — ""\/Aa; on [0, 6] for some A and b which we must find. 

For a feasible solution, G*{dx) = 1 and xG*{dx) — xo, so that 



Xo 

^ ' "'0 



1=1 d ( " \/Ax1 = " \/A6. Xo = I xd(^" \/Xxj = — —b^^-i = " \/A6 



Hence b = nxQ and A = l/(nxo). This gives us that G* is the distribution 
function given in Theorem 12.21 
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7.1.2 Contest with regret from stopping too soon 

Now we have that Lp {x) = (1 + K)F{xY^^ - Kx ^^^^^'^ dy - Ax - 7. 
Then Lp (0) = (1 + if)i^(0)""i - 7 and as before, if (A, 7) G Vp then 7 is 
no n- negative. 

Let il){x) ~ G*{x)'^^^ then Lg'{x) becomes 

Lg- (x) = (1 + K)i;{x) -Kx -^dy - Ax - 7. (27) 

J X y 

Thus we expect il){x) is the solution to Lg*{x) = at least when ip{dx) > 0. 
Setting Lg*{x) — and differentiating (P7)) twice with respect to x, we find 

(1 + K)iP"{x)x + K^'{x) = 0. 
Thus ^{x) = Cix ^+1 + C2, where Ci and C2 are some constants, and then 



G*{x) = " yCixK+i + C2 at least when G*{dx) > 0. Since we are seeking 



an atom-free solution we must have G*{x) = y Cix^+i + C2 on the whole 
interval of [a, b], where Ci > 0. 

Substituting ip{x) = G*{x)"^'^ = [CixtA^ + C2) A 1 into 1^, and setting 
Lq* (x) = we have Vx G [a, b] 







/ 1 \ f G^yT^ + C? f°° 1 

(l + K)(Cix— +G2]-Kx -L^dy~Kx dy - Xx - -f 



(1 + K)GibT^ + KG2 -K-Xb ^ + C2-7 



Jb y 

-4- 

b 



This gives us optimal multipliers 7* = C2 and A* = i (1 + K)Gib'<+^ + KC2 - K 

Since G* is atom-free, G*(a) — and hence Cia '^+^ + G2 =0, and from 
the non-negativity of a and 7* — G2 and the positivity of Ci it follows that 



G2 = a = 0. Thus G*{x) = " y C'lX"'^ on [0, b] for some Ci and b which can 
be identified using the fact that G* corresponds to a probability distribution 
with mean xq. In particular, setting = 1 + {K + l){n — 1) for a feasible 
solution, 

1 = "^CixiTTT^ = "-VCi6V(i^+i), 



Hence Ci = 5-i/(^"^'+i) and then 6 = iVxo and Ci = jj-- Thus G*(x) 



-{/xJNx^ on [0,7Vxo]. 
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7.2 Contest with regret over past failure 

Recall the definition of the Lagrangian Cpiy] A, 7, rf) for the optimization prob- 
lem of Section [5] Let Lp [x, y) be the integrand in the definition of Lp as given 
in pTl) . In order to have a finite optimal solution we require Lp{x,y) < on 
[0,00) X [xojOo). Let Vp be the set of (A, 7, 77) such that Cf{-; X,'^,r]) has a 
finite maximum. Then TJp is defined by 

T^F = {(A, 7, 77) : Lp{x,y) <0;x>0,y> xq}. 

For (A, 7, 77) G Vp the maximum of A, 7, 77) occurs at a measure v* such 
that i'*{dx, dy) — when Lp{x, y) < 0. Conversely we expect that h'*{dx, dy) > 
when Lp{x,y) — 0. 

Let G* {x) — v*[{{u^y) : u < x,xq < y < 00}) be the marginal of v* . 
If the Nash equilibrium is symmetric then we must have G*{x) — F{x) and 
Lg* {x, y) = when i^* {dx, dy) > 0. Motivated by the results of previous sections 
we expect G* to place mass on an interval [a, b] where — a < xq < b. In this 
section we write b = r for the upper limit. 

It follows from the discussion before Theorem IS . ll that for an optimal solution 
either X'\ — A/% or X\ = 0(M\) for some decreasing function cj). Hence, for 
Xo < y < r we expect v*{dx,dy) > if and only if either x ~ y or x — 4>{y). 
Let '4){x) = G*{xY~^. Then LG*{x,y) becomes 

L{x, y) := Lg* (a;, y) = [I + K)Tpix) - K^l){y) - Ax - 7 - f ri{z){x - z)dz. 

Jxa 

Fixing y G {xo,r), and using L{x,y) < for any < x < y, together with 
L{(j){y),y) = we expect |§(</)(y), y) - 0. 
Thus Vy G (a;o: "0 ^-nd (f) must solve 

Hy. y) = Hy) -^y--t- T v{z)iy - z)dz = o, (28) 

J Xq 

L{<l>{y),y) = (1 + if )V(</>(y)) - Ki^iy) - X<P{y) T 77(z)(<^(y) - z)dz ^29) 

J Xo 

— (0(y),y) = (l + W(<^(y))-A- / 77(z)dz = 0. (30) 

OX J Tr. 



Differentiating (|28)) with respect to y, yields 

^'{y)-\- [\{z)dz = Q. (31) 



Comparing (|30l) with ([3T|) . we find V'(?/) = (1 + {4'{y)) ■ If we now set 
61(7;) = -0(0(7/)), then dH) follows. From ([ST]) we find 

V'"(2/) - ri{y) = 0, (32) 
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Then, differentiating (1^ with respect to y, and using ([50)) we obtain —K'tp'{y) — 
v{y){4>{y) ~ y) = 0; and (|10p . Finally, ([TT|) comes directly from ([8]) on noting 
that G(0(m)) = 6i(m)i/("-i) . 

Next we deduce the boundary conditions. First note that from ^ we can 
infer that 4'{xo) = xq and (j){r) = 0. Hence 9{xo) = ^(2^0) and 9{r) = 0. 
Given that ^ and pUl) hold, as in the proof of Theorem 15. 1[ we have that 
(fT5|) holds. Letting y = r and using ■(/'(r) = 1, (/)(r) = and 9{r) = 0, we find 
= -^^ip' i^—) + iP{t) and hence ip'^r—) = Further, letting y ~ r \n (|10p 

we get ^p"{r—) = K{K + l)/r^ as required. 

Lastly, we derive the optimal multipliers which we write as 77*, A* and 7*. 
From r)*{y) = iP"{y) for y S (xo,r). Then, from dST]), A* = ^p'{y) - 

/J^ r,*iz)dz = ^'{y) - 11 r{z)dz = V'(a^o+)- Finally m yields. 



7* = / n* {z){y- z)dz = '0(y)-?/V''(a;o+)- / fp" iz){y-z)dz = ■)jj{xo)-xo'ip' {xo+). 
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